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Abstract 

Background: Increased low-density lipoprotein cholesterol (LDL) concentration is associated with increased risk of 
coronary heart disease (ChHD) but a substantial risk of cardiovascular disease often remains after LDL concentrations 
have been treated to target. Apolipoprotein B (apo B) is the nnajor apolipoprotein contained within atherogenic 
lipoproteins such as LDL, and apo B is a nnore reliable indicator of cardiovascular risk than LDL concentration. 

Aim and methods: Our aim was to develop a formula for calculating apo B using lipid biochemistry 
measurements that are commonly available in clinical practice. We examined the clinical and laboratory data from 
73,047 Koreans who underwent a medical health check that included apolipoprotein B concentration. The study 
sample was randomly divided into a training set for prediction model building and a validation set of equal size. 
Multivariable linear regression analysis was used to develop a prediction model equation for estimating apo B and 
to validate the developed model. 

Results: The best results for estimating apo B were derived from an equation utilising LDL and triglyceride (TG) 
concentrations [ApoB = -33.12 + 0.675''LDL+ 1 l.QS^'Inftg)]. This equation predicted the apo B result with a 
concordance correlation coefficient (CCC and 95%Cls) = 0.936 (0.935,0.937)). 

Conclusion: Our equation for predicting apo B concentrations from routine analytical lipid biochemistry provides a 
simple method for obtaining precise information about an important cardiovascular risk marker. 
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Background 

The association between increased concentrations of 
low-density lipoprotein cholesterol (LDL) and increased 
rates of premature coronary heart disease has been 
clearly demonstrated [1-8]. Current recommendations 
for the management of dyslipidemia are largely based on 
treatments to decrease LDL concentration [5,9-14]. 
However, a significant residual risk of cardiovascular dis- 
ease (CVD) often remains after low-density lipoprotein 
cholesterol levels have been treated to target [15-20]. 



* Correspondence: kcmd.sung@samsung.com 
^Equal contributors 

^Division of Cardiology, Department of Medicine, Kangbuk Samsung 
Hospital, Sungkyunkwan University School of Medicine, #108, Pyung Dong, 
Seoul, Jongro-Ku 1 10-746, Republic of Korea 
Full list of author information is available at the end of the article 



Apolipoprotein BlOO (apoB) is the structural protein for 
atherogenic lipoproteins and facilitates the transporting of 
lipid from the liver to peripheral tissues [15,21-23]. A sin- 
gle apo BlOO molecule is present in all major atherogenic 
particles derived from the liver (very low density lipopro- 
tein (VLDL), intermediate density lipoprotein (IDL) and 
LDL). Consequently, measurement of apoBlOO provides 
direct information as to the number of circulating athero- 
genic particles [23]. Apo BlOO concentration is a better 
measure of LDL particle number concentration and is a 
more reliable indicator of risk than LDL concentration 
[22,24,25]. Thus, addition of apo BlOO concentration to 
the routine lipid profile could improve identification of 
patients at risk of CVD and could improve management 
of those patients who are receiving lipid lowering therapy 
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[24-31]. Apo BlOO measurement also improves CHD risk 
prediction in people with diabetes or metabolic syndrome 
[24,32] and Apo BlOO may provide a better assessment of 
on-treatment residual risk (than LDL) providing support 
for the notion that addition of apo BlOO measurement to 
the routine lipid panel would improve patient manage- 
ment [26,31]. 

Apo BlOO can be measured by commercial immuno- 
assay [33] but assays are time-consuming and costly 
[34]. Although an algorithm for estimating apo BlOO 
has previously been developed by Hermans et al. [23], 
this algorithm was developed in 45 people with dia- 
betes from a Western population. Thus, the aim of our 
study was to develop an algorithm for estimating the 
apo BlOO concentration from easily measured para- 
meters; e.g. age, body mass index (BMI), low desntiy 
lipoprotein cholesterol (LDLc), high density lipopro- 
tein cholesterol (HDLc), triglyceride (TG) and total 
cholesterol (TC) concentrations. 



Methods 

A total of 73,047 apparently healthy subjects were 
recruited for the study. The mean age was 41.73 ± 8.4 
years, [n = 44,118 men (41.9 ± 8.1-years) and n = 28,929 
women (41.4 ± 8.7-years)]. Subjects participated in a rou- 
tine health check-up program that was held at the 
Health Promotion Center of Kangbuk Samsung Hospital, 
Sungkyunkwan University School of Medicine, Seoul, 
Korea in 2008. The medical health checkup program 
was developed to improve the health of employees. Most 
subjects were employees, or family members, from vari- 
ous industrial companies across the country. The cost of 
medical examinations was predominantly paid for by the 
employers, and most subjects underwent a health check 
annually or biannually. The study protocol conformed to 
the ethical guidelines of the 1975 Declaration of Helsinki 
as reflected by a priori approval from our institutions 
Human Research Committee. 

The health check consisted of a full medical history 
and comprehensive blood test evaluation. Participants' 



height and weight were measured barefoot and in light 
clothing. BMI was calculated as weight in kilograms 
divided by height in meters squared. Laboratory exami- 
nations were obtained after an overnight fast. An enzym- 
atic calorimetric test was used to measure TC and TG 
concentrations. The selective inhibition method was 
used to measure HDLc, and a homogeneous enzymatic 
calorimetric method was used to measure the concentra- 
tion of LDLc (Advia 1650 Autoanalyzer, Bayer Diagnostics, 
Leverkusen, Germany). Apo BlOO and apoAl concentra- 
tions were determined by rate nephelometry (IMMAGE 
system; Beckman Coulter). 

Descriptive statistics for continuous variables are pre- 
sented as means, standard deviations (SDs), medians and 
inter-quartile ranges (Ql, Q3). Categorical variable are 
presented as proportions (percentages). 

The study sample was randomly divided into a training 
set for prediction model building and a validation set of 
equal size. Multivariable linear regression analysis was 
used to develop a prediction model equation and to val- 
idate the developed model. Natural log transformation 
was used to normalize the distribution of HDLc, TG, age 
and BMI. Analysis of residuals was used to check 
assumptions for multivariable linear regression model- 
ing. The accuracy of the prediction model equation was 
evaluated using concordance correlation coefficient 
(CCC) analysis (Lin (1989)) that allowed comparison 
between prediction modeling results and the direct 
biochemical measurement of apo BlOO. In all tests, 
j^-values < 0.05 were considered significant. Statistical 
analyses were performed using SAS 9.1.3 (SAS Institute 
Inc, Gary, NC) and R 2.13.2 (Vienna, Austria). 

We conducted subgroup analyses by sex, glucose (7.0 
mmol/1 or 126 mg/dl), BMI (25kg/m^) and apoB quar- 
tile, in order to examine whether the derived equation 
was appropriate for specific subpopulation. 

Results 

The characteristics of subjects in the model building 
subsample and the validation subsample were similar 



Table 1 Characteristics of the whole cohort, and the model building and validation subsets 





Total (n = 73047) 


Model building (n = 36523) 


Validation set (n = 36524) 


Age (years) 


41.73 ±8.38 


41. 7 ±8.4 


41. 7 ±8.4 


Gender (M/F) 


44118/28929 


22034/14489 


22084/14440 


Total cholesterol (mg/dL) 


195.1 7 ±33.79 


195.1 2 ±33.89 


195.22 ±33.70 


TG (mg/dL) 


127.55 ±86.80 


127.40 ±86.25 104(73,1 56) 


127.70 ±87.341 05 (73,156) 


HDLc (mg/dL) 


55.20 ±12.73 


55.1 9± 12.73 


55.20 ±12.74 


LDLc (mg/dL) 


11 0.63 ±29.43 


11 0.62 ±29.48 


11 0.64 ±29.37 


ApoBlOO (mg/dL) 


97.52 ±23.97 


97.53 ± 24.04 


97.50 ±23.0 


BMI (kg/m^) 


23.63 ±3.1 2 


23.64 ±3.1 2 


23.63 ±3.11 



Data are expressed as mean±SD or median [interquartile range], unless indicated otherwise. TG: triglyceride, HDLc: High density lipoprotein cholesterol, LDLc: low 
density lipoprotein cholesterol, Apo BlOO: apolipoprotein BlOO, BMI: body mass index. 
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Table 2 Models for predicting apoBlOO concentration 





Prediction equation 


P-value 


R-square 


CCC[95% CI] 


1 1 Pll ^ 
1 . LULC 


ApoB100 = 


16.177816 + 0.735235*LDLc 


^ ^^^^^^^ 
<.UUU 1 


81 .1 % 


0.90 


[0.890-0.898] 


2. LDLc, age 


ApoB100 = 


-24.77 + 0.72*LDLc + 1 1 .43*ln(age) 


<.0001 


81.9% 


0.90 


[0.899-0.903] 


3. LDLc, BMI 


ApoB100 = 


-49.1 3 + 0.70*LDLc + 21 .8rin(BMI) 


<.0001 


82.4% 


0.90 


[0.902-0.905] 


4. LDLc, BMI, age 


ApoB100 = 


-78.91 + 0.696*LDLc + 9.8rin(L|-0|) + 1 9.98*ln(BMI) 


<.0001 


83.0% 


0.91 


[0.905-0.909] 


5. LDLc, TG 


ApoB100 = 


-33.1 2 + 0.675*LDLc + 1 1 .95*ln(tg) 


<.0001 


88.0% 


0.94 


[0.935-0.937] 


6. LDLc, TG, age 


ApoB100 = 


-59.40 + 0.67*LDLc + 1 1 .63*ln(tg) + 7.68*ln(age) 


<.0001 


88.3% 


0.94 


[0.937-0.939] 


7. LDLc, TG, BMI 


ApoB100 = 


-35.99 + 0.67*LDLc + 1 1 .84*ln(tg) + 1.11 *ln(BMI) 


<.0001 


88.0% 


0.94 


[0.935-0.937] 


8. LDLc, TG, age, BMI 


ApoB100 = 


-59.53 + 0.67*LDLc + 1 1 .62*ln(tg) + 7.68*ln(age) + 0.05*ln(BMI) 


<.0001 


88.3% 


0.94 


[0.937-0.940] 



^Multiple linear regression analyses were used to develop a prediction equation. 
^CCC indicates concordance correlation coefficient. 
^Cl indicates confidence interval. 



with no significant differences between groups (Table 1). 
Subjects with high glucose (>7.0 mmol/1, or >126 mg/dl) 
were only 932(2.6% of total subjects). Variables entered 
into the prediction equation were LDLc, HDLc, TG, age 
and BMI. LDLc, TC, In(TG), In(BMI) and In(age) were 
each associated with apo BlOO, whereas apo B was not 
associated with HDLc. 

We developed equations for predicting apo BlOO from 
the results of multivariable regression modeling and 
compared apo BlOO concentrations obtained by direct 
measurement with apo BlOO estimates from various pre- 
diction model formulae (Table 2). An equation including 
LDLc, In(TG), In(age) and In(BMI), produced the highest 
results and the highest CCC. However, an equation 
that included LDLc, In(TG) and In(Age) produced a 
higher F -statistic than one that also included In(BMI). 
However, since BMI added little to an equation that 
included just LDLc, and TG, we tested the equation 
ApoBlOO = -33.12 + 0.675*LDL + 11.95nn(tg)' in the val- 
idation data-set. In this data-set the CCC was 0.936(95% 
CI(0.935-0.937)). 

We also estimated apo BlOO using published predic- 
tion equations [23,33] and compared predicted values 



from these equations with concentrations obtained from 
direct biochemical measurements. Figures la, b and c 
show the scatter plots for the relationships between pre- 
dicted and observed apo B measurements for the devel- 
oped equation, and for the two published equations. We 
compared CCC values for our equation ApoBlOO = 
-33.12 + 0.675*LDL+11.95nn(tg) with the two pub- 
lished equations (Table 3). We also compared the three 
equations stratified by sex, glucose (7.0mmol/l or 126 
mg/dl) and BMI (>25kg/m^) thresholds, and apo BlOO 
quartiles (Table 4). 

The CCC for obese individuals with a BMI > 28kg/m^ 
was 0.914(95% CI(0.908-0.920)). We also tested the for- 
mula also in subjects with an atherogenic lipid profile, 
namely in those with a TG > 150 mg/dl + HDLc < 40/<50 
mg/dl. In individuals with TG > 150 & HDLc < 50 (fe- 
male), the CCC was 0.923(95% CI(0.917-0.929)). In indi- 
viduals with TG > 150 & HDcL < 40 (male), the CCC was 
0.932(95% CI(0.924-0.939)). We also calculated the per- 
formance of the proposed equation using estimated 
LDLc from Friedewalds equation. The CCCs are 0.920 
(95% CI(0.919-0.922)) in the training data-set and 0.922 
(95% CI(0.920-0.924)) in the validation data-set. 
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Table 3 CCC values for the developed equation^ and published equations 







CCC 


[95% CI] 


Equation!: ApoBl 00 = 


-33.1 2 + 0.675*LDLc + 1 1 .95*ln(tg) 


0.94 


[0.93 0.94] 


Equation 2: ApoBl 00 = 


20.67944 + 0.551 61 4*non HDLc, where non HDLc = total C-HDLc 


0.912 


[0.91 0.91] 


Equation 3: ApoBl 00 = 


6.3 + 0.65*non HDLc, where non HDLc = total C-HDLc 


0.93 


[0.93 0.94] 



Equation2: ApoBl 00 = 20.67944 + 0.551 61 4*non HDLc, where non HDLc = total C-HDLc. 
Equation 2 from Krishnajl KulkarnI (29). 

Equations: ApoB = 6.3 + 0.65^non HDLc, where non HDLc = total C-HDLc. 
Equation 3 from Hermans et al(20). 



Discussion 

Measurement of apo BlOO concentrations helps cardio- 
vascular risk prediction but unfortunately to date apo 
BlOO assays are often not available because they are con- 
sidered expensive and time consuming. Consequently, apo 
BlOO measurements are not readily available to clinicians. 
From a very large cohort of subjects that are representa- 
tive of a general Asian population, we have developed a 
very simple algorithm for estimating apo BlOO concentra- 
tion that utilizes only LDLc and fasting triglyceride con- 
centrations. Although more complex equations fitted the 
data slightly better for predicting apo BlOO; the very sim- 
ple algorithm did not compromise precision, compared 
with the more complex equations. We have also shown 
that the simple algorithm is valid in important sub groups 
of people that included obese subjects, subjects with 
increased plasma glucose concentrations and subjects 
with an atherogenic lipid profile. The derived formula is 
also appropriate if LDLc is estimated from Friedewalds 
equation. 

The CCCs for apoBlOO in Q2 and Q3 were lower than 
for the other two quartiles. To evaluate the reason for 
the lower CCC results in Q2 and Q3, we compared the 
characteristics of the relationship between apoBlOO and 
LDLc in each quartile. In the total data set, the relation- 
ship between apoBlOO and LDLc showed a clear positive 
linear relationship, whereas in Q2 and Q3, the relation- 
ship was not linear In Q2 and Q3 the data points were 



concentrated around the middle of the distribution, ra- 
ther than being spread evenly along the regression line. 
Consequently, the scatter of data in these middle two 
quartiles did not fit the regression line as well as in Ql 
and Q4. Thus, this finding may limit the usefulness of 
our formula in these middle two quartiles. However, our 
equation was developed for the whole population and 
not just for the subjects in the 2^^ and 3^*^ quartile. We 
reason that it is more important to fit the formula to the 
whole population and not to a specific subgroup within 
that population. 

Conclusion 

In conclusion, we have developed an algorithm in a large 
Asian population to derive an estimate of apoBlOO con- 
cetnrations that utilizes only measurement of LDLc and 
triglyceride concentrations from a fasting plasma sam- 
ple. We are unable to comment as to whether the algo- 
rithm is valid in a Western population. 
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Table 4 Comparative CCC results for the developed equation and two published equations, stratified by sex, glucose 
and BMI thresholds and apo B quartile 







Male 




Female 


glucose >126mg/dl Glucose <126mg/dl 


BIVll>25kg/ 


BMI<25kg/m^ 




CCC 


[95% CI] 


CCC 


[95% CI] 


CCC 


[95% CI] CCC 


[95% CI] 


CCC [95% CI] CCC [95% CI] 


Equation! 


0.92 


[0.92 0.9S] 


0.94 


[0.9S 0.94] 


0.9! 


[0.90 


0.92] 0.94 


[0.9S 0.94] 


0.92 [0.9! 


0.92] 0.9S [0.9S 0.94] 


Equation2 


0.90 


[0.90 0.90] 


0.9! 


[0.9! 0.9!] 


0.84 


[0.82 


0.85] 0.9! 


[0.9! 0.92] 


0.89 [0.88 


0.89] 0.9! [0.9! 0.9!] 


Equations 


0.92 


[0.92 0.92] 


0.94 


[0.94 0.94] 


0.84 


[0.82 


0.86] 0.94 


[0.94 0.94] 


0.9! [0.9! 


0.9!] 0.94 [0.94 0.94] 


Apolipoprotein B Quartile 








Q1(<80.24) 


Q2(80.24< Q2< 90.95) 


Q3(90.95< Q3<1 12.86) 


Q4(>1 12.86) 






CCC 




[95% CI] 


CCC 




[95% CI] 


CCC 


[95% CI] 


CCC [95% CI] 


Equation! 




0.7! 




0.70 0.72 


0.25 




0.2S 0.26 


0.56 


0.5S 0.56 


0.75 0.74 0.76 


Equation2 




0.5! 




0.50 0.5! 


0.22 




0.20 0.2S 


0.56 


0.55 0.57 


0.72 0.7! 0.72 


Equations 




0.66 




0.65 0.67 


0.2S 




0.2! 0.24 


0.5S 


0.56 0.54 


0.80 0.79 0.8! 



N = 36,524 participants who were randomly divided Into a validation set. 
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